An Appropriate Abstraction for an Attribute-Oriented Induction

نویسندگان

  • Yoshimitsu Kudoh
  • Makoto Haraguchi
چکیده

An attribute-oriented induction is a useful data mining method that generalizes databases under an appropriate abstraction hierarchy to extract meaningful knowledge. The hierarchy is well designed so as to exclude meaningless rules from a particular point of view. However, there may exist several ways of generalizing databases according to user's intention. It is therefore important to provide a multi-layered abstraction hierarchy under which several generalizations are possible and are well controlled. In fact, too-general or too-speci c databases are inappropriate for mining algorithms to extract signi cant rules. From this viewpoint, this paper proposes a generalization method based on an information theoretical measure to select an appropriate abstraction hierarchy. Furthermore, we present a system, called ITA (Information Theoretical Abstraction), based on our method and an attribute-oriented induction. We perform some practical experiments in which ITA discovers meaningful rules from a census database US Census Bureau and discuss the validity of ITA based on the experimental results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fuzzy Global Attribute Oriented Induction

Attribute-oriented induction is a useful technique that summarizes data of relatively-low values with higher-level concepts. In a large database, it is always beneficial to describe generalized knowledge, rather than lower levels of abstraction. In this paper we modify the concept of AOI and we apply it on a fuzzy relational database. Following the concept proposed by Chen et al. [1] to mine ge...

متن کامل

Object-Oriented Database Mining: Use of Object Oriented Concepts for Improving Data Classification Technique

Complex objects are organized into class/subclass hierarchy where each object attribute may be composed of other complex objects. Almost of the existing works on complex data classification start by generalizing objects in appropriate abstraction level before the classification process. Generalization prior to classification produces less accurate result than integrating generalization into the...

متن کامل

Generalization and Decision Tree Induction: Efficient Classification in Data Mining

Efficiency and scalability are fundamental issues concerning data mining in large databases. Although classification has been studied extensively, few of the known methods take serious consideration of efficient induction in large databases and the analysis of data at multiple abstraction levels. This paper addresses the efficiency and scalability issues by proposing a data classification metho...

متن کامل

Attribute-oriented Induction in Ob Ject-oriented Databases

Knowledge discovery in databases is the nontrivial extraction of implicit, previously unknown, and potentially useful information from data such that the extracted knowledge may facilitate deductive reasoning and query processing in database systems. This branch of study has been ranked among the most promising topics for database research for the 1990s. Due to the dominating influence of relat...

متن کامل

Using an Appropriate Controller for Independent Current Control for Motoring of Force Windings of Bearing less Induction Motor

A bearingless induction machine has combined characteristics of induction motor and magnetic bearings. Therefore, the advantages are small size and low-cost. In the magnetic suspension of the bearingless motors, suspension forces are generated based on the feedback signals of displacement sensors detecting the movement of the rotor shaft. The suspension forces are generated taking an advantage ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999